Concept-to-speech synthesis by phonological structure matching
نویسندگان
چکیده
منابع مشابه
Concept-to-Speech Synthesis by Phonological Structure Matching
This paper presents a new way of generating synthetic speech waveforms from a linguistic description. The algorithm is presented as a proposed solution to the speech generation problem in a concept-to-speech system. Off-line, a database of recorded speech is annotated so as to produce a phonological tree for each sentence in that database. Synthesis is performed by generating a phonological tre...
متن کاملSpeech synthesis by phonological structure matching
This paper presents a new technique for speech synthesis by unit selection. The technique works by specifying the synthesis target and the speech database as phonological trees, and using a selection algorithm which finds the largest parts of trees in the database which match parts of the target tree. The technique avoids many of the errors made by prosody generation modules by incorporating th...
متن کاملFrom Information Structure to Intonation: A Phonological Interface for Concept-to-Speech
The paper describes an interface between generator and synthesizer of the German language concept-to-speech system VieCtoS. It discusses phenomena in German intonation that depend on the interaction between grammatical dependencies (projection of information structure into syntax) and prosodic context (performancerelated modifications to intonation patterns). Phonological processing in our syst...
متن کاملMultilingual phonological analysis and speech synthesis
We give an overview of multilingual speech synthesis using the IPOX system. The first part discusses work in progress for various languages: Tashlhit Berber, Urdu and Dutch. The second part discusses a multilingual phonological grammar, which can be adapted to a particular language by setting parameters and adding languagespecific details.
متن کاملConcept-to-speech generation by integrating syntagmatic features into HMM-based speech synthesis
In conventional concept-to-speech (CTS) methods, a common step is predicting abstract prosodic descriptions, such as the locations of accents and phrase boundaries, from the linguistic information provided by the text generation module. But the prediction results always contain errors, and unacceptable prosodic prediction may ruin the synthesized speech. In addition, linguistic information, whi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences
سال: 2000
ISSN: 1364-503X,1471-2962
DOI: 10.1098/rsta.2000.0594